Schemaless Representation of Semistructured Data and Schema Construction
نویسندگان
چکیده
We should consider semistructured data of which have a weak schema information in networked information world. To manage such semistructured data eeciently, this paper introduces a data model for semistructured data and operations for schema construction. We transform semistructured data into structured one by introducing schema construction methodology, compared to the former studies which are fully dependent on schemaless manipulations. For schema construction, we deened operations for building IS-A/IS-PART-OF relationships, collecting data objects to build a primitive class, and merging two data instances or classes.
منابع مشابه
Schemaless Semistructured Data Revisited - - Reinventing Peter Buneman's Deterministic Semistructured Data Model -
This paper reviews the design of data models for semistructured data, particularly focusing on their schemaless nature. Uniform treatment of schema information and data, in other words, uniform treatment of metadata and data, is important in the design of such data models. This paper discusses what data and metadata are, and argues that attribute names, which are usually regarded as metadata, a...
متن کاملSchema Extraction for Semi-Structured Data
The emerging eld of semistructured data leads to new ways of rep resenting data as schemaless or self describing However in many applications data has often some regularity and ignoring the possibly partial structure hinders the abilities to interpret the data and to access them e ciently In this paper we investigate a knowledge based approach for discovering partial implicit structures from se...
متن کاملPathLog: a Query Language for Schemaless Databases of Partially Labeled Objects
In the paper we deal with the problem of modeling and querying information in schemaless databases of partially labeled objects (PLO-DB). Partially labeled objects are used for modeling data within repositories integrating both structured and semistructured data. The proposed PLO (Partially Labeled Objects) data model originates from the OEM data model and extends it by allowing partial labelin...
متن کاملNF-SS: A Normal Form for Semistructured Schema
Semistructured data is becoming increasingly important for web applications with the development of XML and related technologies. Designing a “good” semistructured database is crucial to prevent data redundancy, inconsistency and undesirable updating anomalies. However, unlike relational databases, there is no normalization theory to facilitate the design of good semistructured databases. In th...
متن کاملSchema Profiling of Document Stores
In document stores, schema is a soft concept and the documents in a collection can have different schemata; this gives designers and implementers augmented flexibility but requires an extra effort to understand the rules that drove the use of alternative schemata when heterogeneous documents are to be analyzed or integrated. In this paper we outline a technique, called schema profiling, to expl...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1997